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In this work we study the problem of targeting signals in networks using entropy information 
measurements to quantify the cost of targeting. We introduce a penalization rule that impose a 
restriction to the long paths and therefore focus the signal to the target. By this scheme we go 
continuously from fully random walkers to a walker biased to the target. We found that the optimal 
degree of penalization is mainly determined by the topology of the network. By mean of the 
analysis of several examples we have found that a small amount of penalization reduce considerably 
the typical walk length, and from this we conclude that a network can be efficiently navigated with 
restricted information. 
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I. INTRODUCTION 

In the problem of targeted signaling or targeted naviga- 
bility in a network, a message or vehicle begins a journey 
at a given source vertex with the intention of reaching 
another target vertex in the most efficient way possible- 
it is implicitly assumed that the message or vehicle is 
restricted to jump from vertex to vertex along the edges 
available in the network. Among the many applications 
of this area it can be mentioned, the distant communica- 
tion in complex systems 1 -, problems related to the traffic 
in cities 2 -! 3 -, etc., all this problems make the area an ac- 
tive field of research. The efficiency in solving the prob- 
lem is measured in terms of a cost which is associated to 
each possible path the message/ vehicle can follow in its 
journey from the source to the target. Two main issues 
have to be accounted in defining the cost of targeting, 
on one side the length of the paths and, on the other 
side how difficult is to identify a set of convenient paths 
which connect the source with the target. Once the cost 
is defined, the efficiency in the task of targeted signal- 
ing/navigability can be improved by choosing a conve- 
nient searching strategy that minimize the cost. 

The most basic strategy for searching the target is the 
non-biased or fully random walker. The message or ve- 
hicle moves randomly without bias through the network 
with the hope that eventually will reach the target. In 
this case there is no cost associated in choosing the appro- 
priate path to the target, then the focus is in the determi- 
nation of how long it will take to the message/vehicle to 
reach its target. This is a problem related to the first pas- 
sage times and has been investigated in several paradig- 
matic network models^. It is important to stress here 
that the random walk strategy have to be differentiated 
from network sampling using random walkers* and, from 
non-specific broadcasting where a signal is propagated 
and amplified as in the spreading of diseases, spam or 
computer viruses^ 7 -. 

Other approaches deviates from a fully random walker 
but still dismiss the quantification of the cost of choos- 



ing the appropriated paths from the source to the target. 
Now the problem is focused in finding a reasonable strat- 
egy for searching the target, the election of a particular 
strategy is driven by the minimization of the length of 
the journey from the source to the target. Among the 
different strategies one can mention: self-avoiding ran- 
dom walks^, intermittent random walks on lattices^, the 
consideration of local topological features 1 ^, and greedy 
strategies which are used in the case of networks with spa- 
tial embeddin g 11 ' 12 . A common feature of all the above 
mentioned methods is that they use knowledges about 
the topological structure for the design of the strategies. 

A different approach to the problem consist in evalu- 
ating the difficulty of choosing the appropriated paths. 
The difficulty can be quantified in terms of the amount 
of information or knowledge required to follow these 
paths. In some cases the information is measured ad- 
hoc, for example using a fixed information cost per ver- 
tex traverse d 13 ' 14 . In general the information required 
for choosing the right direction in the network depends 
on the local topological details, for instance; the informa- 
tion required to take the right direction grows with the 
number of available options. Entropy measures provides 
a natural way of quantifying information, in fact this 
measurements have been applied successfully in complex 
networks before^—. In particular, these methods were 
applied recently for quantifying information in the prob- 
lem of targeted signaling/navigabilit y 2 ! 20 ' 21 . This is the 
approach we adopted in our work to quantify the diffi- 
culty of choosing the appropriate paths connecting the 
source with the target. We will focus on strategies that 
may be adapted to any topology and in which the mes- 
sage/vehicle is represented by a biased random walker. 
These strategies will allow us the interpolation between 
a fully random walker which uses no information to reach 
its destiny and a directed walker that travels along the 
shortest paths using all the available information to ori- 
ent itself. In this regard, there are several antecedents 
with strategies that interpolate to some extend the ran- 
dom and the biased regim o 2 i 13 i 20 ~ — 
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In this work we will follow the line of previous 
work o 13 ' 21 in the sense that an information measure is 
used to regulate how directed the walks are. We extend 
the ideas of Refs. [2"||20| since in these works the infor- 
mation is measured considering only the shortest paths 
and the optimal penalization strategy for the achieve- 
ment of relatively short paths is at expenses of limited 
information^. We introduce a formalism for measur- 
ing the amount of information used by a biased random 
walker to reach its target. Using this formalism we de- 
velop a method which depends of one parameter that 
regulates how biased is the random walk. In our method 
the overall information is increased each time the walker 
perform a step, so longer walks results in larger penal- 
ization. Optimizing the walker's information forces the 
walker to travel along increasingly shorter paths or equiv- 
alently. the paths are biased to the target. 

The paper is organized as follow. In section|TT]we intro- 
duce the theoretical background, defining the measures 
of information used by the random walker in going from 
the source to the target. We also introduce the penaliza- 
tion rules used to interpolate between the random and 
the directed regimes, in particular the optimal penaliza- 
tion is defined. In section ITO1 we analyze simple examples 
that can be solved analytically which are useful to under- 
stand how the method works in different topological en- 
vironments, analyzing some limiting cases. In section HVl 
we applied the method by using numerical simulations 
in more complex networks, like a random network and 
a Barabasi-Albert scale-free network model. Finally, in 
section [V] final remarks, conclusions and possible exten- 
sions to our work are discussed. 



II. THE MODEL 

Consider a non-directed network with N vertices and 
M links where a random walker jumps at a given time 
step from a vertex i to a neighbor vertex j with proba- 
bility qij . For each vertex i in the network the transition 
probabilities q,j satisfy the normalization condition, 



(1) 



where rm, is the set of all nearest neighbors vertices of 
vertex i, and qn = for all i, it means the walker is forced 
to move at each time step. The amount of information 
given to the walker — for taking an exit from a given ver- 
tex i to one of its nearest neighbors — is the information 
cost defined by 2 ^ 



ln(fci) 



E 



(2) 



which is the difference between the maximum entropy in 
the space of events of taking one of the exits minus the 
entropy the exits of vertex % already has associated. Here 



ki denotes the degree of vertex i. Let us consider now the 
walker starts its journey at a source vertex named s and 
ends the trip at a vertex we call the target t; furthermore, 
the walker pass during the journey by the vertex i. We 
want to obtain an expression for the information needed 
in going from s to t given a distribution of probabilities 
qij . From the information cost defined above one can de- 
rive a recursive expression for the amount of information 
S(i — > t) used by the walker in going from vertex i to t, 
accordingly this information cost is expressed as: 

S(i ->t) = ln(fa) + ^2 Inqij + ^ Qij s U ~* *)> 



(3) 

hence, with the constrain that S(t — )• t) = 0, i.e., 
no information is needed by the walker once the tar- 
get is reached, a set of linear equations with unknowns 
{S(i —> i)}i=i,...,jv can be defined and solved provided 
the probabilities {qij} are known. A similar approach 
was used by Rosvall et al (2^1, to quantify the amount 
of information needed by a walker which is restricted to 
walk only the shortest paths. In the case that the ran- 
dom walker can step back during the walk the amount of 
information is 21 : 



S sp (s -»• t) = - In 



i-7ren(s,t) jefr 



(4) 



where H(s, t) denotes the set of all shortest paths n be- 
tween s and t, and tt denotes the set of interior vertices 
of the shortest path ir. 

The minimum for the information S(s — > t) introduced 
in Eq. ([3]) , regarding the transition probabilities , cor- 
respond to a fully random walker with probabilities de- 
fined by, 
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fa' 



(5) 



In this case S(i —> t) = for all i. As expected, in finite 
networks the fully random walker needs no information 
for reaching the target but has the drawback of very long 
walks on average. Since we are interested in targeted 
signaling the results obtained above are of little utility. In 
order to fix this problem, we will introduce a penalization 
rule that weights the paths favoring the shortest paths to 
the target. This penalization will modify the transition 
probabilities that minimizes the information required to 
reach the target; the longer walks will be rejected and 
then a random walker that search the network using this 
probabilities will be biased to the target. 

The simplest way of introducing a penalization is by 
paying a cost each time the walker pass through a vertex. 
This information cost is not used by the walker when is 
travelling the network-unlike the information associated 
to the gjj-but it allows the evaluation of intrinsic prop- 
erties of the paths to the target, taking into account the 
whole network. For instance, depending of the degree of 
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penalization needed for reaching an optimal set of path 
one can estimate how difficult is the task of finding the 
paths in a given network. Once the penalization term is 
introduced in equation ([3]) it becomes, 

F y (i -> t) = ln7+ln(fcj)+ ^ q VJ lng. y + 2J qijF y (j -t t) 



(6) 

where the term, In 7, with 7 > 1 is the penalization term. 
Now since 7 > 1, F(s — > t) = is not anymore a mini- 
mum and hence a random walking does not minimize the 
information. As will be shown in the next section, min- 
imizing F(s — > t) with respect to {qij} keeping 7 fixed 
leads to a biased walk which becomes more directed to 
its target as 7 increases. In fact, the fully random walker 
correspond to 7 = 1 and, in the other extreme when 
7 — > 00 the walker is forced to walk along the shortest 
paths. The quantity Fy(s —> t) stands for the amount of 
information the walker uses in going from s to t plus the 
intrinsic information related to the penalization. 

To clarify the role of 7 let us introduce a quantity that 
will allow to define an optimal value for the penalization. 
First of all, we name by {q*j} the probabilities {qij} that 
minimizes F^(s — > t) at a given fixed value of 7, and 
F*(s — > t) the function evaluated at these values, i.e. 
the minimum. Furthermore, S*(s — > t) is the value of 
S(s —> t) evaluated on { We compute the amount of 
information introduced by 7 as F*(s — s- t) — S*(s —> t) 
which is related to intrinsic properties of the network 
as we mentioned above. Then the relative amount of 
intrinsic information in going from s to t is, 



R 1 (s -> t) 



F;{s^t)-S*{s^t) 
F*(s^t) 



(7) 



The quantity i? 7 (s — > t) lies in (0,1] reaching its maxi- 
mum value 1 when 7 —> 1 or 7 —> 00. It has a minimum 
value R* (s —> t) at 7* 6 (1, 00) which we define as the op- 
timal value of 7. At 7* the walker minimizes the relative 
amount of intrinsic information with respect to the whole 
information. It means that, up to minimum point, the 
information the walker gain above the paths to the target 
is preponderating. An increase of 7 further 7* certainly 
implies a gain of useful information, but at lower pace 
than the intrinsic information. Then, the value 7* gives 
insights above the searchability of a network in relation 
with its topology. 



III. SIMPLE EXAMPLES SOLVED 
ANALYTICALLY 

To further clarify the formal ideas introduced in the 
above section let us analyze some simple examples which 
can be solved analytically. We named each example an- 
alyzed in order to facilitate the discussion (see figure [TJ). 
In addition, these examples will provide some insights on 
the problem of targeted delivery of information or navi- 
gation. In particular, the last two examples correspond 
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FIG. 1: Simple targeted walks in different network's envi- 
ronments which can be solved analytically. Unique- way (a), 
Diamond (b), Star web (c) and Equivalent paths (d) (here 
nf = 1-p). 



to extreme cases in which remarkable different topologi- 
cal patterns prevails. On the one extreme, the case where 
only one right path to the target exist; being all the other 
alternative paths dead ends, and in the other extreme, 
the case where there are a lot of similar paths to the tar- 
get -not all of them optimal- with a few shortest paths. 
The penalization scheme behaves differently in each case 
serving as an indicator of which kind of topological pat- 
tern could prevail on real networks or network models. 



A. The Unique— way path 

This example is outlined in figure (Pa). For the sake of 
clarity let us simplify the notation redefining Ey(i — > t) 
by Fj, S(i —> t) by Si and i? 7 (i — > t) by i?;. In this case 
the set of equations © takes the form: 

F s = hi7 + ln2+plnp+ (1 -p)ln(l - p) + (1 - p)F h 
Fi = hi 7 + F s . (8) 

Solving the equation for F s and minimizing with respect 
to p fixing 7 one obtain the following expression for the 
critical p 



P =2 



1 2 7 2 - 1 



V 



(9) 



It is easily to verify that p* — > 1/2 when 7 — > 1 and that 
p* — > 1 when 7 — > 00 therefore one obtain the expected 
limiting cases. The un-penalized case, 7 — > 1, correspond 
to a fully random walker, and the other case 7 — > 00, 
correspond to a walker fully biased towards the shortest 
path. One can see that p* increases when 7 goes from 
1 to 00 indicating that the bias in the walk grows with 
the penalization 7, in other words a larger penalization 
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FIG. 2: Relative amount of intrinsic information R vs. 7 
for the One way path. The curve reach a minimum value 
(R* ~ 0.7) at 7* ~ 2.5 (indicated with a dot) which corre- 
sponds to the optimal degree of penalization. Inset: Optimal 
information S* (full line) and shortest path information S sp 
(horizontal red dotted line). 



leads to a shorter walk. The relative amount of intrinsic 
information corresponding to this example, 



R, = 



(2 7 2 + l)ln7 



(2 7 2 + 1) In 7 - I117 2 + (2 7 2 - 1) In 



(10) 

is plotted as a function of 7 in Fig. O One can see a 
minimum which correspond to an optimal penalization 
7*. Notice that R s is large even at 7*; more than half 
of the total information F* is due to the information 
introduced by 7. The inset shows the information used 
by the walker to reach the target S* as function of 7. It 
increases as 7 increases and converges asymptotically to 
the case of the shortest path S sp = In 2 as 7 — > 00. 



B. The Diamond 

This is a bit more complicated example (Pb). In this 
case the set of equations (j6|) takes the form: 

F.i = In 7 + In 3 + itlnu + wlnw, 

+(1 — u — w) ln(l — u — w) + uF s + wFi, 
F s = In 7 + Fi, (11) 

symmetry arguments where used here to consider that 
Fi = Fj and p = 1/2. Solving for F s and minimizing 
leads to 



w* = — and u* = —r. 

07 3 r y ! 



(12) 



These probabilities tend to the right limits w* , u* — > 1/3 
for 7—5-1 and u*,w* — > for 7 — > 00 obtaining the 



fully random walker and the fully directed walker respec- 
tively. Notice that the probability u of step away from 
t decreases faster with 7 than the probability w of just 
taking a longer path. An optimal penalization 7* ~ 2.18 
exist where i?* ~ 0.77 which again implies a considerable 
fraction of intrinsic information. Also in this case the 
walker's information about the shorter paths S*(s — > t) 
tends asymptotically to the shortest paths information 
S sp (s — y t) = In 3 as 7 increases. It must be remarked 
that if instead of equation (Q| we use the information of 
the shortest paths where stepping back is avoided 2 ^ then 
S*(s — > t) may become even larger. Anyway, the right 
magnitude to compare with is that of equation Eq. ((4]). 



C. The star web 

The star like network (see Fig. Q]c) represent the ex- 
tremal case of only one direct path to the target and a 
large number of dead ends. At variance with the previ- 
ous examples which contains a fixed number of vertices, 
this example has not restrictions in the number of ver- 
tices, allowing the study of quantities that scales with the 
network's size. Like the previous examples, this new one 
can be analytically solved by using the particular sym- 
metries of this network, whatever the size of the network. 
Equations ([6]) in this case reduces to 



F 



Fj = 
F. = 



In 7 + ln(n + 2) + plnp + (1 — p — w) ln(l — p — w), 

w 



+w In — 

n 

U17 + F h 
In 7 + Fi, 



■pFs + wFj, 



(13) 



where Fj 
F s is, 



Fj . The critical probability of 



P = 



2) 7 2 



and 



(n + 2) 7 2 



(14) 



which satisfy the expected limiting cases: p* — > l/(n + 2), 
w* n/(n + 2) for 7 — ► 1 and p*,w* — > for 7 — > 00. In 
this example it is interesting to compute the amount of 
information related to shortest paths S sp and the optimal 
information S* at 7* as function of the number of ver- 
tices n. The following expression is obtained for the first, 
S S p(s —> t) = ln(n + 2), while the second (S*) is obtained 
numerically. Figure [3] shows these quantities as function 
of (n + 2) in a linear log plot. One can see that also S* 
scales logarithmically with the network's size, however 
this amount of information is always smaller than the in- 
formation related to the shortest path and the difference 
between them increases with n. This implies that, as far 
as the optimal walks defined by {q*j} are convenient, it 
is useful to relax the restriction of walking the shortest 
paths, because it is cheaper in terms of information to 
walk along paths which are not so short. Furthermore, 
the relative amount of intrinsic information R* decreases 
with the system size (see the inset of figure [3]). It means 
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FIG. 3: Information as function of the network's size n in 
a linear-log plot, corresponding to the example (c) of Fig. 
[1] Full blue line shortest path information S ap , green dashed 
line optimal information S*(s — > t). Inset: The optimal pe- 
nalization 7* (full black line). Relative amount of intrinsic 
information (red dashed line). Horizontal orange dotted line 
reference value for R*(s — > t) at 0.5. 



that the walker's information about the shorter paths in 
the network eventually becomes predominant. This is 
also consistent with the decreasing in the value of opti- 
mal penalization 7* as function of n, which is required 
to learn the shorter paths (see the inset). This indicates 
that in this topology the walker can learn efficiently the 
ways to the target. 



D. The Equivalent paths 

This example (see figure [T]d) allows to visualize one of 
the main motivations to generalize the approach which 
measures the information considering only shortest paths 
S s p 20]21 to a measure that includes all the possible paths. 
Specifically this example allows the study of a case in 
which the walker has many alternatives consisting of 
equivalent paths, that are not much longer than the 
shortest ones. Due to its particular topology this exam- 
ple can also be solved analytically for arbitrary network's 
sizes. Accordingly, applying Eq. ^ to this particular ex- 
ample the following set of equations turns out, 

F s = In 7 + ln(n + 1) + plnp 

+(l-p) \ n ^—^ + (l-p)F l 
n 

Fi = In 7 + In 2 + u In u 

+(1 - u) ln(l - u) + uF s , (15) 



where Fi := F ix = 
and minimize, leading to 

27 2 (n + l) -n 



P 



= Fi n . As usual we solve for F s 

(16) 



FIG. 4: The optimal information S*(s — > t) as a function of 
system size n (green dashed line) is compared to the shortest 
path's information S sp (s — > i) (full blue line). Both scale 
logarithmically with n as indicated by the straight lines in 
the linear-log graph, and the difference S sp — S* > remains 
constant. Inset: The optimal penalization 7* (full black line) 
grows with n. The relative amount of intrinsic information 
predominates R* > 0.5 for all the sizes (red dashed line). 
The orange dotted line indicates = 0.5. 



and 



1 27 2 (n + 1) - n 
~ 27(7(71 + 1) + 1) 



(17) 



(7(n + l) + l)(n + ^ 



27 2 (n+l) — n 



which satisfies the right limits p* -> l/(n+ 1), u* — > 1/2 
for 7—^1 and p* —> 1 ,w* — > for 7 — > 00. Like in 
the Star web, in this example the optimal information S* 
and the shortest paths information S sp scales logarith- 
mically with the network's size (see figure 2]), and also 
S sp > S* but at variance with Star web the difference be- 
tween them remains almost constant. Here, the optimal 
penalization 7* grows with system size n and the relative 
amount of intrinsic information R* is always predominant 
(see the inset of figure 2]). These results confirm the in- 
tuitive insight that in this kind of topology it is difficult 
for the walker to learn the optimal walk pattern, which 
is related to the fact that discrimination between several 
similar alternatives is expensive. A comparison between 
this example and the previous one reveals important dif- 
ferences connected with differences in their topologies. 
When dead-ends prevail in a network, the optimal paths 
are easily achievable in an efficient way; a small penal- 
ization is enough and the relative amount of intrinsic in- 
formation is not predominant. On the other hand, if 
the alternative paths prevails then the optimization pro- 
cedure is inefficient; a large penalization is required and 
the amount of intrinsic information is the predominating. 
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IV. NUMERICAL SIMULATIONS RESULTS 

In this section we apply the ideas introduced in the 
previous sections to more complex network topologies. 
We choose two paradigmatic cases such as the random 
and scale-free networks. Since these systems cannot be 
solved analytically, all the results we show here are ob- 
tained by numerical simulations. We performed an opti- 
mization procedure minimizing F s with respect to {qtj} 
for a sequence 7 = 5,26,35, ... where 5 is a small quan- 
tity (5 £ [0.005,0.05]). This is a convenient procedure 
since {q*j} varies smoothly with 7. We start using the 
values given by equation as the initial guess for 7 = 5, 
and then we use the last minimum obtained for the sub- 
sequent values of 7. We performed the minimization 
using the implementation of the SLSQ P 24 ' 25 algorithm 
provided by SciPy 2 ^ as a part of the Sage Mathematics 
Software 2 ''. If 5 is too large the minimization algorithm 
fails to converge since the initial guess is too far away 
from the minimum, even in small networks. The average 
walk length is obtained by a Monte Carlo procedure using 
the set {q*j} for the transition probabilities, this quan- 
tity is analyzed as a function of the penalization and the 
shortest path length between the sources and the target. 

The values for the probabilities {q*j} depend on the 
target vertex t but are independent of the source ver- 
tex s for each value of 7. We tested this analytically 
in the examples of figure [TJ and numerically on a small 
random network. Let's define the vector q* whose com- 
ponents are the no null values of {q*j}- In figure[5]we plot 
the ratio between the dispersion of q* with respect to s, 
D(q*) = l/(n - 2)J2 s ^ t l<T( s ) " {<f)\ 2 , and the norm : 
(q*) = l/(n — 1) Y^sjtt Q*( s )> as a function of 7 and for 
different targets t. It is shown that this ratio is much 
smaller than unity confirming the independency of {q*j} 
with respect to the source. 

Since information is an additive quantity, the relative 
amount of intrinsic information for the overall network 
can be defined, given target t. It considers all the possible 
sources and hence the paths to the target t. Consequently 
we have, 



i? 7 (i) = 



(18) 



then the overall optimal penalization 7 t * associated to 
the target t can be obtained from this expression. For 
the sake of brevity lets omit the reference to t so we will 
refer to R 1 (t) by i? 7 and its optimal version R* (t) by R* . 



A. Random networks 

Let us first analyze the case of a random network, all 
the calculations in this section were performed using a 
network of N = 100 vertices with an average degree 
(k) = 3 and using a target chosen at random. Care 
was taken to obtain a random network that consist of 




FIG. 5: Each curve in this figure plots the ratio D/(q*) (see 
text) as a function of 7 for different targets t. One can see 
this ratio remain small in the whole range of 7 we explored. 
The largest value of the ratio is ~ 0.001 (orange curve). The 
calcuations where performed on a random network with n = 
10 and (k) = 3. 



only one connected component. Here we show results 
corresponding to a single realization for the target and 
the network, since similar results were obtained using dif- 
ferent realizations. Figure [5] shows i? 7 as function of 7 
obtained for this network. From this curve we obtained 
the optimal overall penalization 7* ~ 1.105. In this fig- 
ure we also show {R) L which is the relative information 
average over different sources that are at a fixed distance 
L from the target. We obtained from these curves the 
optimal penalization 7^; the dependence of 7^ on L is 
shown in the inset. We observe that 7^ varies with L 
but in every case 7^ is of the same order of magnitude 
that 7*. 

In order to analyze the role of the penalization in the 
restricted walks toward the target, we analyze typical 
walk lengths as function of 7. We first obtain the transi- 
tion probabilities associated to the target t, for a given 7, 
and then using this probabilities we implement a Monte 
Carlo process for obtaining a set of trajectories corre- 
sponding to random walkers which are biased to the tar- 
get. The random walkers start its journey at every possi- 
ble source available in the network. Averaging over these 
trajectories we obtain the average walk length (wl). In 
figure [7] we plot (wl) corresponding to the same realiza- 
tion for the network and target we used for obtaining 
the results of figure [5] When 7 approaches its minimal 
physical value 7 = 1 the walk is fully random and the 
average walk length between all the possible sources and 
the fixed target is (wl) ~ 260 which is at least an order 
of magnitude larger than the typical distance between 
vertices. Then as 7 is increased the average walk length 
decreases drastically. We denote by (L) t the averaging of 
all the shortest path that reach the target (note that in 
this calculation all sources are included) . At the optimal 
penalization 7 t * the difference between the average walk 
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FIG. 6: In black the relative amount of intrinsic information 
Fly associated to a randomly chosen vertex t (equation I18|) . 
It displays the typical behavior with a minimum at -y| ~ 
1.105. Each of the others curves is the relative amount of 
intrinsic information averaged over the different sources at a 
fixed distance L — 1,2,... from t. Inset: The minimum of 
(R) L \ 7£, depends on L, but is the same order as 7* (red 
dotted line). 



FIG. 8: Walker's information (S) averaged on all the sources 
as function of the penalization 7 (full black circle). The hor- 
izontal red dashed line indicates the average shortest path 
information (S ap ). Vertical green full line indicates the opti- 
mal penalization. Inset: Average walker's information (S) L as 
function of the distance L (full magenta circle) and, shortest 
path information (Sap) L (full cyan square). 
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FIG. 7: In black the walk length (wl) averaged over different 
sources for a fixed target t as a function of the penalization 
7. The red dashed line indicates average shortest path length 
(L) t to t. The vertical full blue line indicates the optimal 
value of the penalization 7*. Inset: In full magenta circle, 
the average walk length (wl) L as a function of the distance 
to t. Cyan full squares relative difference ({wl) L — L)/L, the 
orange dotted line indicates the value 2. 

length and the average shortest paths (L) t is nearly two 
times (L) t , i.e. the average walk length is of the same or- 
der of magnitude than the average of the shortest paths. 
We also computed at the optimal penalization the av- 
erage walk length (vol) L that correspond to averages in 
trajectories restricted to start at sources that are at a 
distance L from the target. We plotted this quantity as 
function L in inset of Fig. [7J The average walk length 



(wl) L increases linearly with the shortest path length L, 
but the relative excess {(wl) L — L)/L is almost constant 
taking a value close to 2 (see inset of figure [7]). In order 
to explore the dependency of the walker's information S 
on the penalization we plot in figure [5] the average, (S), 
over all the sources as a function of 7. It can be seen that 
(S) grows with 7 but it is always much smaller than the 
averaged shortest path information (S sp ), in particular 
at the overall optimal penalization 7 t * . In the present ap- 
proach the walker uses less information than in the short- 
est paths approach S sp but there is a price to pay for; the 
path length to the target (wl) is longer than the shortest 
path L. In addition, according to the current approach 
the amount of information the walker learns on average 
(S) L does not depend on L (inset in figure [5]) as far as L 
is greater than 3. One can think that there is a distance 
horizon (L = 3) that define two regimes. At short dis- 
tances the walker can improve its information about the 
paths to the target as the distance grows. Whereas for 
targets far away the sources the amount of information 
cannot be improved. In other words, the searchability at 
short distance is favore d 21 i 28 . 



B. Scale free networks 

Scale free networks are characterized by a power law 
distributions in the connectivity of the vertices and even 
small networks shows the presence of highly connected 
vertices, when compared to the mean value of their con- 
nectivity. Therefore, although in our case the size of 
the network is small, a scale-free topology will allow us 
the study of how the different quantities are affected by 
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FIG. 9: Average walk length {wl) (full black circles) and the 
average shortest path length (L) (full red squares) as a func- 
tion of the target degree k. Both quantities decreases with 
increasing degrees. Inset: The walker's information {S) (full 
magenta circles) and the shortest path information (Ss P ) (full 
cyan squares). 



the vertex's degree. We performed the calculations on 
a Barabasi-Albert network models with N = 100 ver- 
tices and (k) ~ 4. Figure [5] shows that the walk length 
{wl) and the shortest path length {L) decreases with tar- 
get's degree and the decrease of the former is more pro- 
nounced. The hubs can be found easily having a walk 
length much closer to the shortest paths, whereas in 
poorly connected vertices significantly longer walks are 
required. According to the present approach the hubs 
are favored, both regarding the number of steps and re- 
garding the information that is needed; as far as the op- 
timal condition is easily achievable. Like the case of the 
random network, in the scale-free networks the walker's 
information {S) is significantly smaller than the short- 
est path information {S sp ) (see inset of figure [9]). Both 
quantities decrease with the targets degree confirming 
that highly connected vertices are easier to find. Since 
the shortest path information {S sp ) varies more steeply 
than the walker's information {S) with the target's de- 
gree k, then it follows that {S sp ) is more sensitive to 
the topological details than {S) . Finally, figure [10] shows 
that the optimal penalization and the relative amount 
of intrinsic information grows with the target's degree. 
This implies that it is more expensive to find the optimal 
walking pattern in highly connected targets. 



V. DISCUSSION AND CONCLUSIONS 

In this work we have introduced an approach for mea- 
suring the amount of information used by a biased ran- 
dom walker that moves to a target. In this framework we 
extend the ideas of Rosvall et alj22 because we consider 
not only the shortest paths but all the possibles paths 
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FIG. 10: Average relative amount of intrinsic information 
(7?*) as a function of the target degree k (full red circles) and, 
average of the overall optimal penalization {7 t *) as a function 
of target degree k (full red squares). 



to the target. Based in this approach we propose a pe- 
nalization rule, depending on one parameter, that bias 
a random walker to the target, provided the walker can 
use as little information as possible. The basic idea was 
that each step that makes the walker is penalized, hence 
this leads to an overall penalization that tends to reduce 
the walk lengths. Our approach is consistent since the 
two main quantities that determines the cost associated 
to the task of targeted signaling are counterbalanced; a 
shortening of the walk length through penalization im- 
plies an increase of the information required, and vice- 
versa. At this point, it is important to stress that in this 
scheme the penalization operates globally limiting the 
overall available information, unlike other approaches^! 
in which the information at each vertex is limited. This 
has the advantage of overcoming the undesired effect 
of affecting mostly the highly connected vertices, pro- 
vided the vertex's degrees are taken into accoun t 13 ' 21 . 
We also introduced the idea of intrinsic information, in 
order to define an optimal penalization. We have shown 
through some network models that in practice a small 
amount of penalization is enough to drastically reduce 
the typical walk length, and then a network can be ef- 
ficiently navigated with restricted information. On the 
other hand, once the optimal penalization is reached it 
is highly expensive to further reduce the typical walk 
length, in particular an infinite penalization is required 
to restrict the path lengths to the shortest ones. The 
typical walk length in a random network was analyzed 
and compared to the corresponding shortest path at the 
optimal penalization. It is found that the difference be- 
tween these lengths grows linearly with the shortest path 
length. This is connected with a trade-off at which the 
amount of information does not increases with the length 
of the shortest paths. In addition, from the trend of {S) L 
a distance horizon can be identified which define a range 
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of efficient searchability. 

The ideas introduced in this paper were applied to 
undirected and unweighted networks. But, for instance, 
in the study of traffic on cities directed and weighted 
network are needed, since streets has different capacities 
and directions. The extension of the formalism to include 
directed networks is straightforward, but care must be 
taken to ensure that each vertex is accessible from each 
other vertex, otherwise the analysis have to be restricted 
to each strongly connected components of the network. 
Also the penalization scheme may be generalized to be 
vertex dependent. In the case of the dual representation 
of network's cities where vertices are streets and edges are 



road intersections^, the traffic congestion on each street 
can be used to regulate the amount of penalization in 
order to avoid a traffic jam. Although the proposed for- 
malism requires the solution of an optimization problem, 
that has a large computational cost in a serial imple- 
mentation, the algorithm is specially suited for a parallel 
implementation since each target t can be treated sepa- 
rately. 
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